Recommending Anchor Points in Structure-Preserving Hypertext Document Retrieval

نویسندگان

  • Ben Kao
  • Joseph K. W. Lee
  • David Wai-Lok Cheung
  • Chi-Yuen Ng
چکیده

Traditional WWW search engines index and recommend individual Web pages to assist users in locating relevant documents. Users are often overwhelmed by the large answer set recommended by the search engines. The logical starting point of the hyper-document is thus hidden among the large basket of matching pages. Users need to spend a lot of effort browsing through the pages to locate the starting point, a very time consuming process. This paper studies the anchor point indexing problem. The anchor points of a given user query is a small set of key pages from which the larger set of documents that are relevant to the query can be easily reached. The use of anchor points help solve the problems of huge answer set and low precision suffered by most search engines by considering the hyper-link structures of the relevant documents, and by providing a summary view of the result set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Anchor point indexing in Web document retrieval

Traditional World Wide Web search engines, such as AltaVista.com, index and recommend individual Web pages to assist users in locating relevant documents. As the Web grows, however, the number of matching pages increases at a tremendous rate. Users are often overwhelmed by the large answer set recommended by the search engines. Also, if a matching document is a hypertext, the document structure...

متن کامل

Modelling Anchor Text Retrieval in Book Search based on Back-of-Book Index

This paper proposes a probabilistic logic abstraction for modelling tf -boosting approaches to anchor text retrieval, adapted for the task of page-search in books. The underlying idea is to view the backof-book index (BoBI) as a list of anchors pointing to pages in the book. First, we model the direct application of hypertext-based tf boosting to books and show that this naive method of propaga...

متن کامل

Document Representation and Query Expansion Models for Blog Recommendation

We explore several different document representation models and two query expansion models for the task of recommending blogs to a user in response to a query. Blog relevance ranking differs from traditional document ranking in ad-hoc information retrieval in several ways: (1) the unit of output (the blog) is composed of a collection of documents (the blog posts) rather than a single document, ...

متن کامل

Dynamic Hypertext Synthesis for Information Retrieval

Hypertext navigation alone is insuficient for eficient Information Retrieval (ZR). Previous attempts to combine IR techniques with hypertext have been confined to the pre-authored structure of a document. In this paper we extend computer-science methods to synthesize a tailor-made hypertext document in response to each user's query. The synthesis technique can also be used to automatically crea...

متن کامل

TACHIR: A Tool for Automatic Construction of Hypertexts for Information Retrieval

The paper describes the design and implementation of TACHIR, a prototype tool for the automatic construction of hypertexts for Information Retrieval. TACHIR builds up automatically an IR hypertext, a hypertext to be used for information retrieval, from a document collection, using a methodology that makes use of a set of well known Information Retrieval techniques. The structure of the IR hyper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998